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Abstract — We address the capacity of a discrete-time memoryless 
Gaussian cliannel, wliere tlie cliannel state information (CSI) is neitlier 
available at tlie transmitter nor at tlie receiver. The optimal capacity- 
achieving input distribution at low signal-to-noise ratio (SNR) is precisely 
characterized, and the exact capacity of a non-coherent channel is 
derived. The derived relations allow to better understanding the capacity 
of non-coherent channels at low SNR. Then, we compute the non- 
coherence penalty and give a more precise characterization of the sub- 
Unear term in SNR. Finally, in order to get more insight on how the 
optimal input varies with SNR, upper and lower bounds on the non-zero 
mass point location of the capacity-achieving input are given. 

I. INTRODUCTION 

In wireless communication, the channel estimation at the receiver is 
not often possible due, for instance, to the high mobility of the sender 
or the receiver or both. Therefore achieving reliable communication 
over fading channels where the channel state information (CSI) is 
available neither at the transmitter nor at the receiver, is of a particular 
interest. When CSI is not available at both ends, computing the 
channel capacity, known as the non-coherent capacity, as well as 
computing the optimal input distribution achieving this capacity, 
for both SISO and MIMO channels, was addressed by Marzetta 
and Hochwald using a block fading channel [1]. The non-coherent 
capacity was also computed as a function of the number of transmit 
and receive antennas as well as the coherence time at high SNR in 
[2]. At a low SNR regime, it was also shown that to a first order of 
magnitude of the SNR, there is no capacity penalty for not knowing 
the channel at the receiver which is not the case at the high SNR 
regime [2] [3], [4]. Recently, this power efficiency at a low SNR 
regime or equivalently at a large channel bandwidth has motivated 
work towards a better understanding of the non-coherent capacity at 
a low SNR regime [5], [6], [7] for both SISO and MIMO channels 
using several fading models. 

For a discrete-time channel, computing the non-coherent capacity 
is a rather tedious task [8] [9]. The main difficulty in computing the 
non-coherent capacity relies on the fact that the capacity-achieving 
input distribution is discrete with a finite number of mass points, 
where one of them is located at the origin. The number of these 
mass points increases with the signal-to-noise ratio (SNR). Since no 
bound on the number of mass points with respect to SNR is actually 
available, it is very difficult to find closed form expressions for both 
the achievable capacity and the optimal input distribution for all SNR 
values. Fortunately, numerical computation of the capacity and the 
optimal input distribution has been made possible using the Khun- 
Tucker condition which is a necessary and sufficient condition for 
optimality, for of a SISO channel [8] and for a MIMO channel [9]. 
In [10], the channel mutual information at low SNR is computed 
and thus the capacity of a discrete time non-coherent memoryless 
channel is obtained through numerical optimization. However, the 
framework presented in [10] does not provide any insight on the 
capacity behavior at low SNR. 



In this paper, we analyze the capacity of a discrete time non- 
coherent memoryless Rayleigh fading SISO channel at low SNR. 
The main contributions of this paper are: 

1) Derivation of an analytical closed form of the channel mutual 
information at low SNR, which may also be considered as a 
lower bound on the channel mutual information for an arbitrary 
SNR value. 

2) Derivation of a fundamental relation between the capacity- 
achieving input distribution and the SNR value, from which 
an exact capacity expression is deduced at low SNR. 

3) Derivation of upper and lower bounds on the non-zero mass 
point location of the optimal input, which allow to deduce lower 
and upper bounds respectively on the non-coherent capacity at 
low SNR. 

The paper is organized as follows. Section II presents the system 
model. In section III, we derive a closed form expression of the 
channel mutual information at low SNR which is also a lower bound 
on the channel mutual information at all SNR values. The optimal 
input distribution as well as the non-coherent capacity are computed 
in Section IV. Numerical results are reported in Section V and Section 
VI concludes the paper. 

II. CHANNEL MODEL 

We consider a discrete-time memoryless Rayleigh-fading channel 
given by: 



r{l)=h{l)s{l)+w{l), 



« = 1,2,3, 



(1) 



where I is the discrete-time index, s{l) is the channel input, r{l) is the 
channel output, h{l) is the fading coefficient and w{l) is an additive 
noise. More specifically, h{l) and 'w{l) are independent complex 
circular Gaussian random variables with mean zero and variances 
(T^ and CT^, respectively. The input s{l) is subject to an average 
power constraint, that is i5[js(/)j^] < P, where E[.] indicates the 
expected value. It is assumed that the channel state information is 
available neither at the transmitter nor at the receiver. However, even 
though the exact values of h{l) and w{l) are not known, their statistics 
are, at both ends. Since the channel defined in (1) is stationary and 
memoryless, the capacity achieving statistics of the input s{l) are also 
memoryless, independent and identically distributed (i.i.d). Therefore, 
for simplicity we may drop the time index Z in (1). Consequently, the 
distribution of the channel output r conditioned on the input s can be 
obtained after averaging out the random fading coefficient h, yielding: 



fr\sir\s) 



■ exp 



(2) 



Noting that in (2), the conditional output distribution depends only on 
the squared magnitudes |s|^ and |r|^, we will no longer be concerned 
with complex quantities but only with their squared magnitudes. Let 



lLB{xi,a) = 



a [Ifili+fil + 1 + ^ . 2F1 f 1, ^, 1 + _ In (1 _ ^) _ In (1 + " ^ ) if xi > y^, 

[ x{ 1+xf 1+xf ^ ^\ ' xf ' x{' a J\ \ xfj \ ' (l+xf)(x{-a) J ^ ' 

if xi = y/a 



xi — {1 + x'i) lri(l + xi) — n 



xl + xf 



1 + xi — n cot ( ) + In 



xl + xf 



(9) 
(11) 



y = \r\^ ja^ and let x = |s|(T;j/o-tu. Conditioned on the input, y is 
chi-square distributed with two degrees of freedom: 



Sy\x{v\x) 



■ exp 



-V 



l + a;2 



(3) 



(l + a;2) 

with the average power constraint -B[a;^] < a, where a = Pcrf^/a^ 
is the SNR per symbol time. 

III. THE CHANNEL MUTUAL INFORMATION 
For the channel (3), the mutual information is given by [II]: 

fy\x{y\x) 



I{x;y) = [ [ fy\x{y\x)fx{x)\n J^^"" f ^ dxdy. 

J J J(,y;x)(ytX) 

The capacity of channel (3) is the supremum 
C = sup I{x; y) 

E\x^]<a 



(4) 



(5) 



over all input distributions that meet the constraint power. The 
existence and uniqueness of such an input distribution was established 
in [8]. More specifically, the optimal input distribution for channel 
(3) is discrete with a finite number of mass points, where one of them 
is necessarily null. That is, the capacity (5) is expressed by 



C ■ 



max 

E[x^] 



X y^Pi fy\xi{y\xi)ln 
^" i=o ■'0 



fylxM^i) 



T,jPjfy\xj{y\xj) 



dy, 



(6) 

where xo = < xi < X2 ■ ■ ■ < xn~i are the mass point locations 
and where po,pi . . . ,pjv-i their probabilities respectively. Since we 
focus on the low SNR regime, we may use in (6) a discrete input 
distribution with two mass points, where one of them is null, to 
obtain the optimal capacity at low SNR [8]. Furthermore, this on-off 
signaling also provides a lower bound on the non-coherent capacity 
for all SNR values. That is, a lower bound on the capacity may be 
expressed by: 

Clb = max lLBix;y), (7) 

E[x^]<a 

where Ilb{x; y) is a lower bound on the channel mutual information 
I{x;y) given by: 



lLB{x;y)=lLB{xi,pi) 



=}ZPi / /y|a:i(j/|a;i)ln 



fy\xi{y\Xi) 



T,iP3fy\xAy\xj) 



dy, 



and the average constraint power becomes: piXi < a. Note that the 
optimization problem in (7) deals with only two unknowns pi and 
xi. Furthermore, it is proven below that further simplifications can be 
obtained, using the fact that Ilb{xi,pi) is monotonically increasing 
in xi and thus the problem at hand may be reduced to a simpler 
maximization problem without constraint. We summarize this result 
in Lemma I. 

Lemma 1: The optimal capacity at low SNR and a lower bound 
on it for all SNR values is given by: 



Cl 



max lLB{xi,a), 

xi'>-t/a 



(8) 



where lLB{xi,a) is the channel mutual information for a given mass 
point location xi and a given SNR value a, given by (9) at the top 
of the page, and where 2^1 (•,•,-,•) is the Gauss hypergeometric 
function. Furthermore, the constraint power holds with equaUty and 
we have: E[x^] = pix\ = a. 

Proof: The proof is presented in [12]. ■ 
In Lemma I, the existence of a maximum for a given SNR value 
a is guaranteed by the continuity of lLB(xi,a) and the fact that it is 
bounded with respect to xi over the interval [\/a, 00 [. The existence 
of such a maximum was rigorously established in [12]. Clearly, the 
maximization (8) is reduced to solving the equation -^lLB{xi,a) 
for a given SNR value a. Ideally, an analytical solution would 
provide an insight as to how the non-coherent capacity and the on-off 
signaling vary with the SNR. However, solving such an equation for 
arbitrary SNR values is very ambitious since it comes to finding an 
analytical solution to involved transcendental equations. Nevertheless, 
it is of interest to focus on the low SNR regime to get the benefit 
of some advantageous simplifications in order to elucidate the non- 
coherent capacity behavior at low SNR. 

IV. NON-COHERENT CAPACITY AT LOW SNR 

In this section, we will use Lemma 1 to derive a fundamental 
analytical relation between the optimal input distribution at a low 
SNR regime and the particular SNR value a. We show in Theorem 
I that this fundamental relation holds up to an order of a strictly 
less than 2. As is shown below, the derived relation is very useful 
since it allows computing the optimal input distribution for a given 
SNR value a while providing a rigorous characterization as to how 
the non zero mass point locations and their probabilities vary with 
a. Moreover, the derived relation may be used to compute the exact 
non-coherent capacity at low SNR values. 

A. A fundamental relation between the optimal input and the SNR 

We present the fundamental relation between the optimal input 

distribution and the SNR value in the following theorem. 

Theorem 1: At a low SNR value a, the optimal input probability 
distribution for an order of magnitude of a strictly less than 2 
(©(a^"*^) for an arbitrary positive e), is given by: 



fx{x) = 



x\ with probability pi = , 

^1 

with probability po = 1 — pi , 



(10) 



where xi is the solution of the equation (11) given at the top of the 
page. Furthermore, the non-coherent channel capacity is given by: 



C7(o, x\) = a — a ■ 



\n{l + xl) H 



l + xi 



(12) 



Proof: The proof is presented in [12]. ■ 

Clearly, (II) is also a transcendental equation, for which determining 
an analytical solution is a very tedious task. Although it is very 
involved to derive an analytical solution of (11) in the form of 
Ki = /(a), it is of interest from an engineering point of view, to 



a = exp 



xiW{k, ^(xi)) - + TT cot (-^) + In (xl) + In (1 + xi) - 1 



sin (^)(-a;2 + In (1 + x^) + a;' In (1 + a;')) 
^{x) = ^ — exp 



-TTCOt I 



+ 1+^ 



(14) 
(15) 



resolve (11) numerically and obtain the optimal xi for a given SNR 
value a. One may then get the value of the non-coherent capacity from 
(12). Moreover, (11) provides some insight on the behavior of xi as 
a tends toward zero. For example, using (11), one may determine the 
limit of Xl as a tends toward zero. To see this, let M be this limit 
and let us assume that M is finite. From [6], [8], we know that for 
the optimal input distribution, the non-zero mass point location xi 
is greater than one. Thus, its limit as a tends toward zero is greater 
or equal than one: M > 1. Then, talcing the limits on both sides of 
(11) as a goes to zero yields: 



{1 + M^)\n{l + M'^) =0. 



(13) 



That is, if M is finite, it would be equal to zero, the unique solution 
to (13), but this is impossible since M > 1. Hence, consistently with 
[6], [8], lima^i = oo. 

Furthermore, we have found that (11) may be written more 
conveniently as (14) at the top of the page, with A: = — 1 if o < ao 
and fc = elsewhere, and where W(-,-) is the Lambert fimction, 
with ip{x) given by (15) at the top of the page too. Also, ao is the 
solution of (11) for xi = xo, where xo is the root of the equation 
ip{x) = — i. The number — ^ comes out in our analysis from the 
fact that it is the unique point shared by the principal branch of the 
Lambert function W{0, x) and the branch with k = —1, W{—l,x). 
That is VK(0, — i) = W(—l,—^). This guarantees the continuity 
of a in (14) for all a;i values. Numerically, we have found that 
Oo = 0.0582 and xo = V3.93388. Hence, (14) may also be viewed 
as a fundamental relation between the optimal input distribution 
and the SNR a for discrete-time non-coherent memoryless Rayleigh 
fading channels at low SNR. On the other hand, (14) provides the 
global answer as to how the non-zero mass point location of the 
optimal on-off signaling and the SNR are linked together. For this 
purpose, a simple analysis of (14) has been done and some important 
results are recapitulated in the following corollary. 

Corollary 1: According to (14), we have: 

1) For all o < oo, oo = 0.0582, a is an decreasing function with 
respect to xi and for all a > ao, a is an increasing function 

of Xl. 

2) For all a, xi > xo, where xo = V3. 93388. 

3) lim = 0. 

XI — *oo 

Corollary 1 agrees with [8] where it was shown using computer 
simulation that the non-zero mass point location passes through a 
minimum before moving upward. However, by specifying the edge 
point (I'o, Oo), Corollary 1 gives a more precise characterization con- 
cerning this peculiar behavior of the non-zero mass point locations. 
Furthermore, Corollary 1 also refines the lower boimd on si, xi > 1 
and derives xo as an improved lower bound on the non-zero mass 
point location at low SNR. Moreover, from (14), we may write: 

ln{a)+xl = xlW{k,ip{xi)) +7rcot (^) + ln{xi{l + xj)) - 1. 

Xl 

(16) 

It is then easy to check that the right hand side (RHS) of (16) is a 
decreasing function of for xi < xq, which yields an upper bound 
on xi: 

< -ln(o)+6, (17) 



where = In(oo) + Xq, which is again consistent with the upper 
bound derived in [6]. Note that the upper bound (17) is valid for all 
a < ao whereas the upper bound provided in [6] holds for a <^ ao 
for which ^o is negligible. On the other hand, combining (17) and 
the lower bound on xi provided in Corollary 1 one may obtain: 

o"a;o<o"x?<o"(4o-ln(o)). 



for all a > 0. That is: 



lim (a"xi) = 0, 



(18) 



(19) 



which means that o" tends toward zero faster than x\ does toward 
infinity. This result may also be used to gain further insight on the 
capacity behavior at low SNR. For instance, from (12), we may write 
the non-coherent capacity as: 



C(a) = o -h o(o), 



(20) 



ln(l + a:5=) 

7i 



where o{a) = —a ■ ^ 

meaning that the non-coherent capacity varies linearly wii o at low 
SNR and hence non-coherent communication at low SNR may be 
qualified as energy efficient communication. 

B. Energy efficiency and non-coherence penalty 

In general, the capacity of a channel including a Gaussian channel 
and a Rayleigh channel varies Unearly at low SNR [6], The difference 
between these chaimels in terms of capacity can only be explained 
by the sub-linear term o{a) in (20). The sub-linear term has been 
defined in [6] as: 

A(a) := a- C(a). (21) 

At low SNR, the sub-linear term A (a) is also related to the energy- 
efficiency. Let En be the transmitted energy in Joules per information 
nat, then we have: 

^ • C(o) = o. (22) 



Using (21), we can write: 

En _ 

1 - 



A{o) 



1 + 



A(o) 



(23) 



where the approximation holds if is sufficiently small. Note that 
if 0, then from (21) and (23), we have respectively: C(a) « 

a and ^ ~ 1, which implies that the highest energy efficiency 
of -1.59 (dB) per information bit could be theoretically achieved. 
For a Gaussian channel and a fading channel under the coherent 
assumption, the sub-linear terms are respectively given by [6]: 

1 



^awgn{o) = -a +o{a ) 



2 

Aco/ierent(o) — 2^ 



4l 2 , / 2x 

Jo -1-0(0 ) 



(24) 

(25) 



For a non-coherent Rayleigh fading charmel, the sub-linear term can 
be computed using (12): 



A(a) 



ln{l + xl) , H 
a • — — ^ + a 



l + xl 



(26) 



— Non-zero mass point location (simuialion) 
-♦-Non-zero mass point location given by (16) 
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Figure 1. Location of non-zero mass point versus the SNR value a (linear). 



Figure 2. Non-coherent capacity versus the SNR value a (linear). 



Note that at very low SNR and following (26), 



A(a) 



converges to 



zero making the non-coherent Rayleigh channel also energy efficient. 



However, as SNR increases, the convergence of to zero is slower 



than 



^AWGN (a) 



and 



This could be seen from (19) 



indicating that xi converges slower to infinity than a does to zero. 

In the range of SNR values of interest, we may define the non- 
coherence penalty per SNR as: 

Ccoherentio-) — C{a) ^^^^ 

a 

where Ccoherent IS the channel capacity under coherent assumption. 
Now, from [6], we can write Ccoherent as: 



Ceoherent{a) — a + 0{a) = a + o{a 



(28) 



for any 1 > a > 0. Recalling that the non-coherent capacity in (12) 
was obtained using series decomposition to an order strictly smaller 
than 2 (o(a for an arbitrary positive e), then combining (12) and 
(28), we derive the exact non-coherence penalty per SNR up to this 
order: 



Ccoherent{(^) C [d^ Ccohe 



c 



Ccoh 



ln(l + x^) 



+ a' 



1 + xi 



'-Am 



Note that the non-coherence penalty is equal to the sub-linear term 
A(a). Now using (19), dividing both sides of (29) by a", (a > 0) 
and taking the limit as a tends to zero yields: 



where ^ means 



Ccoherent{a) ~ C {o) ^ a 



1 + a 



lim - 

-►0 



(30) 

Inequality (30) 

indicates that not only non-coherence penalty is much greater than 
as was established in [5], but more precisely, it is much greater 
than a^+" since a^+° >• a^, 1 > a > 0. Again, this result is in full 
agreement with [6]. 

In this subsection, we have discussed exact closed forms of the 
optimal input distribution and the non-coherent capacity based on 
the fundamental relation (11) or equivalently (14). However, one may 
be interested in deriving simpler lower and upper bounds on these 
quantities in order to provide more insight on how they vary with the 
SNR value a. This is discussed next. 



C. Upper and lower bounds on the non-coherent capacity 

Considering (14), since we are interested in the low SNR regime, 
we assume for simplicity that a < ao- Thus the Lambert function in 
(14) is the branch with k — —1, W{—l,x). A lower bound on the 
non-coherent capacity is easily obtained by combining (17) and (12) 
and will be referred to as Clb^o)- We now derive the lower bound 
on the optimal non-zero mass point location and the upper bound on 
the non-coherent capacity in Theorem 2. 

Theorem 2: At low SNR values a, a lower bound on the optimal 
non-zero mass point location is given by: 

y 



Xl,LB 



\ 



(31) 



where y 



1 + \n-. Furthermore, an upper bound on the non- 



coherent capacity can be obtained from (12) as: 

CuB{a) = C(a,xi,Ls) (32) 
Proof: The proof is presented in [12]. ■ 

V. Numerical Results AND Discussion 

The curves in Fig. 1 show respectively, the non-zero mass point 
location of the capacity-achieving input distribution xi obtained 
using maximization (8), and the one obtained using relation (11) 
or equivalently (14). As can be seen from Fig. 1, the two curves 
are undistinguishable at low SNR, confirming that (15) is exact at 
low SNR. As the SNR increases, a small discrepancy between the 
two curves starts to appear. This is expected since (14) holds for up 
to an order of magnitude strictly smaller than 2 and thus for small 
SNR values, (but not smaller than about 2.10"^), a discrepancy may 
appear. Nevertheless, even for an SNR greater than 2.10"^, the curve 
obtained using (14) is very instructive especially as it follows the 
same shape as the one obtained by simulation results. An interesting 
future work would be to use (15) in order to understand why a 
new mass point should appear as the SNR increases. It should be 
mentioned that the discrepancy observed in Fig. 1 may be rendered 
as small as desired using high order series expansion. However, the 
analysis would be unrewardingly too complex. 

Figure 2 depicts the non-coherent capacity curves. Again, the curve 
obtained by computer simulation and the one obtained using (12) 




Figure 3. Non-coherence penalty per SNR versus the SNR value a (linear). 

are undistinguishable. More interestingly, the discrepancy observed 
at not very low SNR values in Fig. 1 has vanished, implying that 
the capacity is not very sensitive to the non-zero mass point location. 
Also shown in Fig. 2 is the linear approximation C(a) — a, which 
is an upper bound on the capacity. As can be noticed in Fig. 2, 
the linear approximation follows the same shape as the exact non- 
coherent capacity curves at low SNR and becomes quite loose for 
SNR values greater than 10~^. This implies that the sub-linear term 
defined in (21) is much more important at these SNR values. This 
can be seen in Fig. 3 where we have plotted the non-coherence 
penalty percentage given by (29). Figure 3 confirms that there is no 
substantial gain in the channel knowledge in a capacity sense at very 
low SNR, thus indicating that non-coherent communication is almost 
as power-efficient as AWGN and coherent communications. As the 
SNR increases, a non-coherence penalty begins to appear reaching 
up to 70%. 

The derived upper and lower bounds on the non zero mass point 
locations given respectively by (17) and (31) as well as the bounds 
derived in [6] are plotted in Fig. 4 along with the exact curves at low 
SNR. As can be seen in Fig. 4, the upper bound in [6], albeit tighter 
than (17), crosses the exact curves at about 2.10^^. At these not 
so low SNR values, the derived bound in [6] is no longer an upper 
bound, consistently with our discussion in subsection IV-A. On the 
other hand, the lower bound (31) is tighter than the one derived in 
[6] for all SNR values. 

VI. Conclusion 

In this paper, we have addressed the analysis of the capacity 
of discrete-time non-coherent memoryless Gaussian channels at low 
SNR. We have computed explicitly the channel mutual information 
at low SNR which is also a lower bound on the channel mutual 
information, albeit not necessarily at low SNR values. Using the 
derived expression of the channel mutual information, we have been 
able to provide a fundamental relation between the non-zero mass 
point location of the capacity-achieving input distribution and the 
SNR. This fundamental relation brings the complete answer about 
how the optimal input distribution varies with the power constraint 
at low SNR. It also provides an analytical explanation on what was 
previously observed through computer simulation in [8] about the 
peculiar behavior of the non-zero mass point location at low SNR 
values. The exact non-coherent capacity has been derived and insights 



Figure 4. Exact non-zero mass point locations and the derived upper and 
lower bounds as well as those reported in [6] versus the SNR value a (linear). 

on the capacity behavior which can be gained through functional 
analysis has been shown. 

In order to better understand how the non-zero mass point location 
varies with the SNR, we have also derived lower and upper bounds 
which have been compared to recently derived bounds. The newly 
derived lower bound is tighter for all SNR values of interest, whereas 
somewhat looser, the upper bound was shown to hold for larger SNR 
values. 
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